CDS

Accession Number TCMCG033C13854
gbkey CDS
Protein Id TQE00588.1
Location join(717721..717832,717931..718007,718097..718152,718244..718316,718429..718540,719043..719116,719441..719515,719643..719774)
Organism Malus baccata
locus_tag C1H46_013832

Protein

Length 236aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA428857, BioSample:SAMN08323692
db_source VIEB01000210.1
Definition hypothetical protein C1H46_013832 [Malus baccata]
Locus_tag C1H46_013832

EGGNOG-MAPPER Annotation

COG_category O
Description The proteasome is a multicatalytic proteinase complex which is characterized by its ability to cleave peptides with Arg, Phe, Tyr, Leu, and Glu adjacent to the leaving group at neutral or slightly basic pH
KEGG_TC -
KEGG_Module M00340        [VIEW IN KEGG]
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01002        [VIEW IN KEGG]
ko03051        [VIEW IN KEGG]
KEGG_ko ko:K02738        [VIEW IN KEGG]
EC 3.4.25.1        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko03050        [VIEW IN KEGG]
map03050        [VIEW IN KEGG]
GOs GO:0000502        [VIEW IN EMBL-EBI]
GO:0005575        [VIEW IN EMBL-EBI]
GO:0005622        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005839        [VIEW IN EMBL-EBI]
GO:0019774        [VIEW IN EMBL-EBI]
GO:0032991        [VIEW IN EMBL-EBI]
GO:0044424        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:1902494        [VIEW IN EMBL-EBI]
GO:1905368        [VIEW IN EMBL-EBI]
GO:1905369        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGGCTTCTTCCGACATGGATCTCAACGCCCCTCACTCCATGGGCACCACCATCATCGGCGTCACCTACGACGGAGGCGTCGTCCTTGGCGCCGACTCTCGTACCAGCACGGGAGTATACGTGGCCAATCGCGCCTCCGACAAAATCACGCAGCTCACCGACAATGTCTACGTCTGCCGCTCTGGATCGGCAGCCGATTCCCAGGTTGTTTCCGACTACGTTCGTTACTTCCTTCATCAGCACACAATTCAGCTTGGGCAACCTGCGACTGTCAAAGTTTGTGCAAACCTCGTCAGGCTGCTGTCCTATGGTAACAAGAATATGTTGGAAACTGGACTTATTGTTGGCGGGTGGGACAAGTACGAAGGTGGTAAGATTTATGGGATTCCTCTTGGTGGCACACTGCTAGAACTGCCCTTTGCCATTGGAGGATCTGGCTCCAGTTACTTGTATGGATTTTTCGATCAAGCATGGAAAGAAGGAATGACCAAGGACGAAGCTGAGCAATTGGTGGTCAAGGCTGTTTCTCTCGCCATTGCACGAGATGGTGCCAGTGGGGGTGTTGTCCGTACTGTAGTTATAAATTCTGAGGGAGTGACAAGAAACTTCTATCCTGGCGACAAACTTCCACTGTGGCATGAGGAGTTGGAGCCTCAGAACTCATTGTTGGACATATTGAACACTGCTAGTCCCGAGCCAATGAACATATGA
Protein:  
MASSDMDLNAPHSMGTTIIGVTYDGGVVLGADSRTSTGVYVANRASDKITQLTDNVYVCRSGSAADSQVVSDYVRYFLHQHTIQLGQPATVKVCANLVRLLSYGNKNMLETGLIVGGWDKYEGGKIYGIPLGGTLLELPFAIGGSGSSYLYGFFDQAWKEGMTKDEAEQLVVKAVSLAIARDGASGGVVRTVVINSEGVTRNFYPGDKLPLWHEELEPQNSLLDILNTASPEPMNI